-
Notifications
You must be signed in to change notification settings - Fork 25k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Analysis enhancement - better plural stemmer than minimal_english. #43248
base: main
Are you sure you want to change the base?
Conversation
Pinging @elastic/es-search |
|
||
// *CHES - would be good to find a simple rule that solves lunches, churches but doesn't break aches | ||
// documenting current behaviour here as a known issue: | ||
assertAnalyzesTo(analyzer, "lunches", new String[]{"lunche"}); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looking at http://www.thefreedictionary.com/words-that-end-in-ches I think that 'avalanche' and 'headache' are the odd ones out here - consider brooch/brooches, branch/branches, couch/couches. Maybe have a specific rule for -ache
, and some exceptions?
Drops the trailing “e” in taxes, dresses, watches etc that otherwise cause mismatches with plural and singular forms Closes elastic#42892
Made ies->y stemming stricter so short words match eg ties==tie Removed special-case code for crazy-rare words iaes and eies
…nes-> oe e.g. shoes
What's the status of this PR @markharwood ? Is it still relevant ? |
Pinging @elastic/es-search (Team:Search) |
Pinging @elastic/es-search-relevance (Team:Search Relevance) |
Drops the trailing “e” in taxes, dresses, watches, dishes etc that otherwise cause mismatches with plural and singular forms.
See the issue for recall benchmarking results on typical data.
Closes #42892